Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
RL, reward functions, policy gradient, agents, simulation
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
5331
posts in
10.8
ms
Don’t Let
Reinforcement
Learning Act
Alone
pub.towardsai.net
·
2d
🤖
Game AI
MolmoBot
: Training robot manipulation
entirely
in simulation
allenai.org
·
1d
🎭
Anthropic Claude
rag not lag:
rl
for
blazing
fast agentic retrieval
cgft.io
·
3d
·
Discuss:
Hacker News
🎯
Retrieval Systems
CUDA
AgentCUDA
Agent: Large-Scale Agentic RL for High-Performance
CUDA
Kernel Generation
arxiv.org
·
1d
🎲
Procedural Generation
Free
Simulation
and
Modeling
in your Browser
insightmaker.com
·
2d
⚡
Productivity
Systematic
debugging for AI agents: Introducing the
AgentRx
framework
microsoft.com
·
8h
🎯
AI Agents
Reinforcing
the World's Edge: A
Continual
Learning Problem in the Multi-Agent-World Boundary
arxiv.org
·
2d
🤖
Game AI
Every minute you
aren
’t running 69 agents, you are
falling
behind
geohot.github.io
·
2d
·
Discuss:
Hacker News
♟️
Game Theory
Designing
AI agents to
resist
prompt injection
openai.com
·
1d
·
Discuss:
Hacker News
🛡️
AI Security
AI Agents: Why the Gap Between Demo and
Deployment
Keeps
Widening
hackernoon.com
·
1d
🎯
AI Agents
How
AlphaEvolve
Discovers
New Multi‑Agent Learning Algorithms
pub.towardsai.net
·
21h
🎯
AI Agents
Containerized
Hosts
for AI Agents
coasts.dev
·
1d
🎯
AI Agents
TIL a LLM-based simulation of a specific person (like
Grammarly
does without asking) is called a “
sloppelgänger
”
hachyderm.io
·
1d
🧠
LLM
Intelligence as a
Commodity
psychologytoday.com
·
5h
🎯
Decision Making
Ai2
Introduces Open, Simulation-First Stack for Physical AI,
Achieving
Zero-Shot Transfer to Real Robots
allenai.org
·
1d
🛡️
AI Safety
Writing an LLM from scratch, part
32e
–
Interventions
: the learning rate
gilesthomas.com
·
2d
·
Discuss:
Hacker News
🧠
LLM
AI in 2030: What Today’s
Developers
Are Building for
Tomorrow
hackernoon.com
·
1d
🎭
Anthropic Claude
From
raw
interaction to
reusable
knowledge: Rethinking memory for AI agents
microsoft.com
·
2d
✍️
Prompt Engineering
e2b-dev/awesome-devins
: Awesome Devin-inspired AI agents
github.com
·
1d
✍️
Prompt Engineering
The AI-Ready Software Developer #21 – Stuck In A “
Doom
Loop
”? Drop A Gear
codemanship.wordpress.com
·
1d
✍️
Prompt Engineering
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help